chore: promote staging to staging-promote/b952d229-23331469361 (2026-03-20 07:18 UTC) by ironclaw-ci[bot] · Pull Request #1459 · nearai/ironclaw

ironclaw-ci · 2026-03-20T07:18:20Z

Auto-promotion from staging CI

Batch range: c4ab382522c86e7e19d55fee760b125fb1970518..c17626160ce956a5e7c64a59b3e65c1801fee21f
Promotion branch: staging-promote/c1762616-23332963145
Base: staging-promote/b952d229-23331469361
Triggered by: Staging CI batch at 2026-03-20 07:18 UTC

Commits in this batch (12):

cac6f40 Add owner-scoped permissions for full-job routines (Add owner-scoped permissions for full-job routines #1440)
6b0f84b perf: use Arc in embedding cache to avoid clones on miss path (perf: use Arc in embedding cache to avoid clones on miss path #1438)
8920322 fix: staging CI triage — consolidate retry parsing, fix flaky tests, add docs (fix: staging CI triage — consolidate retry parsing, fix flaky tests, add docs #1427)
8526cde fix: restore libSQL vector search with dynamic dimensions (fix: restore libSQL vector search with dynamic dimensions #1393)
455f543 fix(routines): surface errors when sandbox unavailable for full_job routines (fix(routines): surface errors when sandbox unavailable for full_job routines #769)
3a52334 fix: f32→f64 precision artifact in temperature causes provider 400 errors (fix: f32→f64 precision artifact in temperature causes provider 400 errors #1450)
806d402 feat: chat onboarding and routine advisor (feat: chat onboarding and routine advisor #927)
31c3b5b feat(agent): activate stuck_threshold for time-based stuck job detection (feat(agent): activate stuck_threshold for time-based stuck job detection #1234)
ef3d769 fix(security): validate embedding base URLs to prevent SSRF (fix(security): validate embedding base URLs to prevent SSRF #1221)
b952d22 fix: prefer execution-local message routing metadata (fix: prefer execution-local message routing metadata #1449)
e82f4bd fix: register sandbox jobs in ContextManager for query tool visibility (fix: register sandbox jobs in ContextManager for query tool visibility #1426)
c176261 fix: skip credential validation for Bedrock backend (fix: skip credential validation for Bedrock backend #1011)

Current commits in this promotion (2)

Current base: staging-promote/b952d229-23331469361
Current head: staging-promote/c1762616-23332963145
Current range: origin/staging-promote/b952d229-23331469361..origin/staging-promote/c1762616-23332963145

e82f4bd fix: register sandbox jobs in ContextManager for query tool visibility (fix: register sandbox jobs in ContextManager for query tool visibility #1426)
c176261 fix: skip credential validation for Bedrock backend (fix: skip credential validation for Bedrock backend #1011)

Auto-updated by staging promotion metadata workflow

Waiting for gates:

Tests: pending
E2E: pending
Claude Code review: pending (will post comments on this PR)

Auto-created by staging-ci workflow

#1426) * fix: register sandbox jobs in ContextManager for query tool visibility Sandbox jobs created via execute_sandbox() were persisted to the database but never registered in the in-memory ContextManager. Since all query tools (list_jobs, job_status, job_events, cancel_job) only search the ContextManager, sandbox jobs were invisible to the agent despite running successfully in Docker containers. Changes: - Add register_sandbox_job() to ContextManager (pre-determined UUID, starts InProgress, respects max_jobs) - Extract insert_context() helper to deduplicate create_job_for_user and register_sandbox_job - Add update_context_state / update_context_state_async to sync ContextManager state on sandbox job completion/failure - Extend job_monitor with spawn_job_monitor_with_context() and spawn_completion_watcher() so fire-and-forget jobs transition out of InProgress when the container finishes - Make CancelJobTool sandbox-aware (stops container + updates DB) - Wire sandbox deps into CancelJobTool in register_job_tools() - 8 regression tests across context manager and job monitor Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: add missing allow_always field in PendingApproval test literal Upstream commit 09e1c97 added the allow_always field to PendingApproval but missed updating the test struct literal, breaking compilation. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Bedrock uses IAM credentials (instance roles, env vars, SSO) resolved by the AWS SDK at call time, so `provider` is never set during startup. Exclude it from the post-init validation that checks for missing API keys. Closes #1009 Co-authored-by: brajul <brajul@amazon.com> Co-authored-by: Illia Polosukhin <ilblackdragon@gmail.com>

* channels/wasm: implement telegram broadcast path for message tool * channels/wasm: tighten telegram broadcast contract and tests * fix: resolve merge conflicts with staging for wasm broadcast - Remove duplicate broadcast() impls from WasmChannel and SharedWasmChannel (staging already has the generic call_on_broadcast path) - Remove obsolete telegram-specific test helpers and tests that tested the old telegram-only broadcast logic - Add test_broadcast_delegates_to_call_on_broadcast for the generic path - Fix missing fallback_deliverable field in job_monitor test SseEvents Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: davidpty <127684147+davidpty@users.noreply.github.com> Co-authored-by: firat.sertgoz <f@nuff.tech> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* feat(web): add light theme with dark/light/system toggle (#761) Add three-state theme toggle (dark → light → system) to the Web Gateway: - Extract 101 hardcoded CSS colors into 30+ CSS custom properties - Add [data-theme='light'] overrides for all variables - Add theme toggle button in tab-bar (moon/sun/monitor icons) - Theme persists via localStorage, defaults to 'system' - System mode follows OS prefers-color-scheme in real-time - FOUC prevention via inline script in <head> - Delayed CSS transition to avoid flash on initial load - Pure CSS icon switching via data-theme-mode attribute Closes #761 * fix: address review feedback and code improvements (takeover #853) - Fix dark-mode readability bug: .stepper-step.failed and .image-preview-remove used --text-on-accent (#09090b) on var(--danger) background, making text unreadable. Changed to --text-on-danger (#fff). - Restore hover visual feedback on .image-preview-remove:hover using filter: brightness(1.2) instead of redundant var(--danger). - Use const/let instead of var in theme-init.js for consistency with app.js (per gemini-code-assist review feedback). Co-Authored-By: CPU-216 <3125034290@stu.cpu.edu.cn> Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: address CI failures and Copilot review feedback (takeover #853) - Fix missing `fallback_deliverable` field in job_monitor test constructors (pre-existing staging issue surfaced by merge) - Validate localStorage theme value against whitelist in both theme-init.js and app.js to prevent broken state from invalid values - Add matchMedia addEventListener fallback for older Safari/WebKit - Add i18n keys for theme tooltip and aria-live announcement strings (en + zh-CN) to match existing localization patterns - Move .sr-only utility from inline <style> to style.css [skip-regression-check] Co-Authored-By: CPU-216 <3125034290@stu.cpu.edu.cn> Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Gao Zheng <3125034290@stu.cpu.edu.cn> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…1461) * feat(llm): add OpenAI Codex backend config and OAuth session manager Add OpenAiCodex as a new LLM backend variant with config for auth endpoint, API base URL, client ID, and session persistence path. The session manager implements OpenAI's device code auth flow (headless-friendly, no browser required on the server) with automatic token refresh, following the same persistence pattern as the existing NEAR AI session manager. Closes #742 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat(llm): add Responses API client and token-refreshing decorator Native Responses API client for chatgpt.com/backend-api/codex/responses, the endpoint that works with ChatGPT subscription tokens. Handles SSE streaming, text completions, and tool call round-trips. Token-refreshing decorator wraps the provider to pre-emptively refresh OAuth tokens before API calls and retry once on auth failures. Reports zero cost since billing is through subscription. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat(llm): wire OpenAI Codex into provider factory, CLI, and setup wizard Connect the new provider to the LLM factory, add openai_codex to the CLI --backend flag, and add it as an option in the onboarding wizard. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(llm): address PR #744 review feedback (20 items) Review fixes for the OpenAI Codex provider PR: - Remove dead `generate_pkce()` code (device flow gets PKCE from server) - Fix `refresh_tokens()` to use `.form()` instead of `.json()` per OAuth spec - Inline codex dispatch into `build_provider_chain()` (single async function, no separate `assemble_provider_chain()` helper — matches main's pattern) - Remove Clone from `OpenAiCodexSession`, restrict fields to `pub(crate)` - Propagate HTTP client builder error instead of silent fallback - Redact device code response body from debug log - Change `set_model()` in TokenRefreshingProvider to delegate to inner - Replace hardcoded `/tmp/` test path with `tempfile::tempdir()` - Accept `request_timeout_secs` from config instead of hardcoded 300s - Parse `Retry-After` header on 429 responses (matches nearai_chat.rs pattern) - Reuse `normalize_schema_strict()` for Codex tool definitions - Add warning log for dropped image attachments - Add doc comments on `list_models()` and `include` field - Add `OPENAI_CODEX_API_URL` to `.env.example` - Fix codex error message in `create_llm_provider()` for clarity - Revert unrelated `.worktrees` addition to `.gitignore` - Update `src/llm/CLAUDE.md` with Codex provider docs [skip-regression-check] Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: address review feedback and harden OpenAI Codex provider (takeover #744) Security: - Add SSRF validation (validate_base_url) on OPENAI_CODEX_AUTH_URL and OPENAI_CODEX_API_URL, matching the pattern used by all other base URL configs (regression test for #1103 included) Correctness: - Add missing cache_write_multiplier() and cache_read_discount() trait delegation in TokenRefreshingProvider - Cap device-code polling backoff at 60s to prevent unbounded interval growth on repeated 429 responses - Default expires_in to 3600s when server returns 0, preventing immediately-expired sessions - Fix pre-existing SseEvent::JobResult missing fallback_deliverable field in job_monitor.rs tests Cleanup: - Extract duplicated make_test_jwt() and test_codex_config() into shared codex_test_helpers module Co-Authored-By: Sanjeev-S <Sanjeev-S@users.noreply.github.com> Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: address PR review feedback on OpenAI Codex provider (#1461) - Login command now resolves OPENAI_CODEX_* env overrides even when LLM_BACKEND isn't set to openai_codex (Copilot review) - Setup wizard "Keep current provider?" for codex no longer re-triggers device code login — mirrors Bedrock's keep-and-return pattern (Copilot) - Revert provider init log from info back to debug (Copilot) - Add warning log when token expires_in=0, before defaulting to 3600s (Gemini review) Co-Authored-By: Sanjeev-S <Sanjeev-S@users.noreply.github.com> Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Sanjeev Suresh <Sanjeev-S@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

…7636 chore: promote staging to staging-promote/cba1bc37-23334371795 (2026-03-20 16:12 UTC)

…1795 chore: promote staging to staging-promote/c1762616-23332963145 (2026-03-20 08:09 UTC)

vnz and others added 2 commits March 19, 2026 23:22

ironclaw-ci bot added the staging-promotion label Mar 20, 2026

github-actions bot added scope: agent Agent core (agent loop, router, scheduler) scope: tool Tool infrastructure scope: tool/builtin Built-in tools size: XL 500+ changed lines risk: medium Business logic, config, or moderate-risk modules contributor: core 20+ merged PRs labels Mar 20, 2026

ilblackdragon and others added 5 commits March 20, 2026 00:41

Merge pull request #1466 from nearai/staging-promote/3da9810e-2335168…

ddf64e8

…7636 chore: promote staging to staging-promote/cba1bc37-23334371795 (2026-03-20 16:12 UTC)

Merge pull request #1462 from nearai/staging-promote/cba1bc37-2333437…

0194275

…1795 chore: promote staging to staging-promote/c1762616-23332963145 (2026-03-20 08:09 UTC)

henrypark133 merged commit bb57e36 into staging-promote/b952d229-23331469361 Mar 23, 2026
11 of 13 checks passed

github-actions bot added scope: channel/cli TUI / CLI channel scope: channel/web Web gateway channel scope: channel/wasm WASM channel runtime scope: llm LLM integration scope: setup Onboarding / setup scope: docs Documentation labels Mar 23, 2026

henrypark133 deleted the staging-promote/c1762616-23332963145 branch March 23, 2026 19:00

github-actions bot added risk: high Safety, secrets, auth, or critical infrastructure and removed risk: medium Business logic, config, or moderate-risk modules labels Mar 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: promote staging to staging-promote/b952d229-23331469361 (2026-03-20 07:18 UTC)#1459

chore: promote staging to staging-promote/b952d229-23331469361 (2026-03-20 07:18 UTC)#1459
henrypark133 merged 7 commits intostaging-promote/b952d229-23331469361from
staging-promote/c1762616-23332963145

ironclaw-ci bot commented Mar 20, 2026 •

edited by github-actions bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ironclaw-ci bot commented Mar 20, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Auto-promotion from staging CI

Commits in this batch (12):

Current commits in this promotion (2)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ironclaw-ci bot commented Mar 20, 2026 •

edited by github-actions bot

Loading